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© The invention relates to ribose- and 2-deoxyribose compounds having the general formula 
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The definitions of the compounds is given in the claims. 
_ Applications include detection and localization of polynucleotide sequences in chromosomes, fixed cells, 
StissuVsections. and cell extracts. Specific applications include chromosomal karyotyping, climcaJ d.agnos.s of 
Snucleic acid-containing etiological agents, e.g. bacteria, viruses, or fungi, and diagnosia of genetic disorders. 
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RIBOSE- AND 2-DEOXYRIBOSE COMPOUNDS 



Many procedures employed in biomedical research and recombinant DNA technology rely heavily on 
the use of nucleotide or polynucleotide derivatives radioactively labeled with isotopes of hydrogen ( 3 H), 
phosphorous Pp), carbon 0*C), or iodine ( 1Z5 I). Such radioactive compounds provide useful indicator 
probes that permit the user to detect, monitor, localize, or isolate nucleic acids and other molecules of 
s scientific or clinical interest, even when present in only extremely small amounts. To date, radioactive 
materials have provided the most sensitive, and in many cases the only, means to perform many important 
experimental or analytical tests. There are. however, serious limitations and drawbacks associated with the 
use of radioactive compounds. First, since personnel who handle radioactive material can be exposed to 
potentially hazardous levels of radiation, elaborate safety precautions must be maintained during the 
ro preparation, utilization, and disposal of the radioisotopes. Secondly, radioactive nucleotides are extremely 
expensive to purchase and use, in large part due to the cost of equipment and manpower necessary to 
provide the appropriate safeguards, producer/user health monitoring services, and waste-disposal programs. 
Thirdly, radioactive materials are often very unstable and have a limited shelf-life, which further increases 
usage costs This instability results from radiolytic decomposition, due to the destructive effects associated 
j 5 with the decay of the radioisotope itself, and from the fact that many isotopes (e.g. M p and 125 l) have half- 
lives of only a few days. ... , u 
It is known that haptens can combine with antibodies, but can initiate an immune response only if 
bound to a carrier. This property can be exploited in detection and identification testing. 

It is also known that biotin and iminobiotin strongly interact with avidin. a 68.00 dalton glycoprotein from 
20 egg white. This interaction exhibits one of the tightest, non-covalent binding constants (K^IO"' 5 ) seen in 
nature If avidin is coupled to potentially demonstrable Indicator molecules, including fluorescent dyes. e.g. 
fluorescein or rhodamine; electron-dense reagents. e.g. ferritin, hemocyanin. or colloidal gold; or enzymes 
capable of depositing insoluble reaction products. e.g. peroxidase or alkaline phosphatase, the presence, 
location or quantity of a biotin probe can be established. Although iminobiotin binds avidin toss tightly than 
as biotin similar reactions can be used for its detection. Moreover, the reversibility of the iminoblotin-avidin 
interaction, by decreasing solution pH, offers significant advantages in certain applications. 

The specificity and tenacity of the biotin-avidin complex has been used in recent years to develop 
methods for visually localizing specific proteins, lipids, or carbohydrates on or within cells (reviewed by EJV. 
Bayer and M. Wilchek in Methods of Biochemical Analysis. 26. 1. 1980). Chromosomal location of RNA has 
30 been determined by electron microscopy using a biotinized protein, cytochrome C, chemically cross-linked 
to RNA as a hybridization probe. The site of hybridization was visualized through the binding of avidin- 
ferritin or avidin-methacrylate spheres mediated by the avidin-biotin interaction (J.E. Manning. N.D. Her- 
shey TR Broker. M. Pellegrini. H.K. Mitchell, and N. Davidson. Chromosoma, S3. 107, 1975; J.E. Manning, 
M Pellegrini, and N. Davidson. Biochemistry. 61, 1364, 1977; T.R. Broker. L.M. Angerer. P.H. Yen, N.D. 
35 Hershey. and N. Davidson. Nucleic Acid Res., 5. 363, 1978; A. Sodja and N. Davidson, Nucleic Acid Res.. 
5 383 1978). This approach to the detection* of polynucleotide sequences, although successful in the 
specialized cases examined which were highly reiterated sequences, is not of general utility for analysis of 
polynucleotides present in single or low copy number. ^ 

Moreover, methods for attaching chemical moieties to pyrimidine and purine rings are known. Several 
40 years ago a simple and rapid acetoxymercuration reaction was developed for Introducing covalently bound 
mercury atoms into the 5-position of the pyrimidine ring, the C-8 position of the purine ring or the C-7 
position of a 7-deazapurine ring, both in nucleotides and polynucleotides. (R.M.K. Dale. D.C. Livingston and 
DC Ward. Proc. Natl. Acad. Sci. U.S.A.. 70. 2238. 1973; R.M.K. Dale. E. Martin. D.C. Livingston and D.C. 
Ward Biochemistry. 14. 2447. 1975.) It was also shown several years ago that organomercurial compounds 
45 would react with oleflric compounds in the presence of palladium catalysts to form carbon-carbon bonds 
(R F Heck J Am. Chem. Soc.. 90. 5518. 1988; R.F. Heck. Ibid.. 90. 5526. 1968; R.F. Heck. Ibid.. 90. 5531. 
1968- RF Heck. Ibid., 90. 553571968; and R.F. Heck. J. Am. Chem.Soc. 91. 6707. 1969.) Bergstrom and 
associates (J.L. Ruth and D.E. Berstrom, J. Org. Chem.. 43. 2870. 1978; and D.E. Bergstrom and M.K. 
Ogawa. J. Am. Chem. Soc, 100. 8106. 1978) and Bigge. et al. (C.F. Bigge. P. Kalaritjs. J.R. Deck and M.P. 
50 Mertes J. Am. Chem. Soc., 102 2033, 1980) have recently applied this reaction scheme in the synthesis of 
C-5 substituted pyrimidine nucleotide compounds. 

Finally, it is known that antibodies specific for modified nucleotides can be prepared and used for 
isolating and characterizing specific constituents of the modified nucleotides. (T.W. Munns and M.K. 
Liszewski. Progress in Nucleic Acid Research and Molecular Biology. 24. 109, 1980.) However, none of the 
antibodies prepared to date against naturally occurring nucleotides have been shown to react with their 
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nucleotide determinant when it xists in a double-stranded RNA and DNA duplex or when in DNA-RNA 
hybrid molecules. 

To circumvent the limitations of radioactively labeled probes or previously utilized chemical and 
biological probes, a series of novel nucleotide derivatives that contain biotin, iminobiotin, lipoic acid, and 

5 other determinants attached covalently to the pyrimidine or purine ring have been synthesized. These 
nucleotide derivatives, as well as polynucleotides and coenzymes that contain them, will interact specifically 
and uniquely with proteins such as avidin or antibodies. The interaction between modified nucleotides and 
specific proteins can be utilized as an alternative to radioisotopes for the detection and localization of 
nucleic acid compoments in many of the procedures currently used in biomedicai and recombinant-DNA 

10 technologies. Methods employing these modified nucleotide-protein interactions have detection capacities 
equal to or greater than procedures which utilize radioisotopes and they often can be performed more 
rapidly and with greater resolving power. 

These new nucleotide derivatives can be prepared relatively inexpensively by chemical procedures 
which have been developed and standardized as discussed more fully hereinafter. More significantly, since 

75 neither the nucleotide probes of this invention nor the protein reagents employed with them are radioactive, 
the compounds can be prepared, utilized, and disposed of, without the elaborate safety procedures required 
for radioisotopic protocols. Moreover, these nucleotide derivatives are chemically stable and can be 
expected to have functional shelf-lives of several years or more. Finally, these compounds permit the 
development of safer, more economical, more rapid, and more reproducible research and diagnostic 

20 procedures. 

The invention relates to ribose- and 2-deoxyribose compounds having the general formula 



25 



30 



35 



40 



46 



x-CH 2 o 



OH Z 
* # 

wherein B represents a 7-deazapurine or a pyrimidine moiety covalently bonded to the C 1 -position of the 
sugar moiety, provided that when B is a 7-deazapurine. the sugar moiety is attached at the ^-position of 
the deazapurine. and when B is a pyrimidine, the sugar moiety is attached at the N^posrtion of the 
pyrimidine; 

wherein A represents a ligand containing at least three carbon atoms; 

wherein the dotted line represents a linkage group joining B and A. the linkage group including carbon- 
carbon single bonds, carbon-carbon double bonds, carbon-nitrogen single bonds or carbon-oxygen single 
bonds, provided that if B is a 7-deazapurine, the linkage group is attached to the 7-position of the 
deazapurine, and if B is a pyrimidine, the linkage group is attached to the 5-position of the pyrimidine; 
wherein X represents a moiety selected from: 



HO 



H0-|-O-P-O-, or H0-?-O-^-0-S-0-, 
OH OH OH OH OH OH 



and 

50 wherein Z is -OH or -H. 

Claims 2 to 44 relate to further embodiments of the invention. 
These compounds may be prepared by a process which involves: 
(a) reacting a compound having the structure: 

56 
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with a mercuric salt in a suitable solvent under suitable conditions so as to form a mercurated compound 
10 having the structure: 



15 



X-CH- v 0 



B-Hg* 



20 



(b) reacting said mercurated compound with a chemical moiety reactive with the -Hg portion of said 
mercurated compound and represented by the formula •••N, said reaction being carried out in an aqueous 
solvent and in the presence of KaPdCU under suitable conditions so as to form a compound having the 
structure: 



30 



X-CH 2 



y » 



35 



40 



wherein N is a reactive terminal functional group or is A; and 

(c) recovering said compound as said modified nucleotide when N is A, or when N is a reactive 
terminal group, reacting said compound with a compound having the structure M-A, wherein M represents a 
functional group reactive with N in an aqueous solvent under suitable conditions so as to form said modified 
nucleotide, which is then recovered. 

This invention also provides compounds having the structure: 



so 



55 
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40 



30 wherein each of B, b\ and B" represents a purine. 7-deazapurine. or pyrimidine moiety covalentiy bonded 
to the C 1 -position of the sugar moiety, provided than whenever B, B , or B is purine or 7-deazapurine, it is 
attached at the N^-position of the purine or 7-deazapurine, and whenever B,B,orB is pyrimidine, it is 
attached at the N 1 -position; 

wherein A represents a moiety consisting of at least three carbon atoms which is capable of forming a 
35 detectable complex with a polypeptide when the compound is incorporated into a double-stranded duplex 
formed with a complementary ribonucleic or deoxyribonucleic acid molecule. 

wherein the dotted line represents a chemical linkage joining B and A, provided that if B is purine the 
linkage is attached to the 8-position of the purine, if B is 7-deazapurine, the linkage is attached to the 7- 
position of the deazapurine, and if B is pyrimidine, the linkage is attached to the 5-position of the 

pyrimidine; 
wherein z represents H- or HO-; and 

wherein m and n represent integers from 0 up to about 100,000. 

These compounds can be prepared by enzymatic polymerization of a mixture of nucleotides which 

include the modified nucleotides of this invention. Alternatively, nucleotides present in oligo- or poly- 
45 nucleotides may be modified using chemical methods. 

Nucleotides modified in accordance with the practices of this invention and oligo- and polynucleotides 

into which the modified nucleotides have been incorporated may be used as probes in biomedical research. 

clinical diagnosis, and recombinant DNA technology. These various utilities are based upon the ability of the 

molecules to form stable complexes with polypeptides which in turn can be detected, either by means of 
50 properties inherent in the polypeptide or by means of detectable moieties which are attached to, or which 

interact with, the polypeptide. . 
Some uses include detecting and identifying nucleic acid-containing etiological agents, e.g. bacteria and 

viruses; screening bacteria for anitbiotic resistance; diagnosing genetic disorders, e.g. thalassemia and 

sickle cell anemia; chromosomal karyotyping; and identifying tumor cells. 
55 Several essential criteria must be satisfied in order for a modified nucleotide to be generally suitable as 

a substitute for a radioactively-label d form of a naturally occurring nucleotide. First, the modified 

compound must contain a substituent or probe that is unique, i.e., not normally found associated with 

nucleotides or polynucleotides. Second, the probe must r act specifically with chemical or biological 
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reagents to provide a sensitive detection system. Third, the analogs must be relatively efficient substrates 
for commonly studied nucleic acid enzymes, since numerous practical applications require that the analog 
be enzymatically metabolized, e.g., the analogs must function as substrates for nucleic acid polymerases. 
For this purpose, probe moieties should not be placed on ring positions that sterically, or otherwise, 
interfere with the normal Watson - Crick hydrogen bonding potential of the bases. Otherwise, the 
substituents will yield compounds that are inactive as polymerase substrates. Substitution at ring positions 
that alter the normal "anti" nucleoside conformation also must be avoided since such conformational 
changes usually render nucleotide derivatives unacceptable as polymerase substrates. Normally, such 
considerations limit substitution positions to the 5-position of a pyrimidine and the 7-position of a punne or a 

7-deazapurine. . . 

Fourth the detection system should be capable of inter acting with probe substituents incorporated into 
both single-stranded and double-stranded polynucleotides in order to be compatible with nucleic acid 
hybridization methodologies. To satisfy this criterion, it is preferable that the probe moiety be attached to 
the purine or pyrimidine through a chemical linkage or "linker arm" so that it can readily interact with 
antibodies, other detector proteins, or chemical reagents. 

Fifth the physical and biochemical properties of polynucleotides containing small numbers of probe 
substituents should not be significantly altered so that current procedures using radioactive hybridization 
probes need not be extensively modified. This criterion must be satisfied whether the probe is introduced 
by enzymatic or direct chemical means. 

Finally the linkage that attaches the probe moiety should withstand all experimental conditions to which 
normal nucleotides and polynucleotides are routinely subjected, e.g.. extended hybridization times at 
elevated temperatures, phenol and organic solvent extraction, electrophoresis, etc. 

All of these criteria are satisfied by the modified nucleotides described herein. 

These modified nucleotides have the structure: 




wherein B represents a purine, 7-deazapurine, or pyrimidine moiety covalentjy bonded to the C -position of 
the sugar moiety, provided that when B is purine or 7-deazapurine. it is attached at the N s -position of the. 
purine of 7-deaza purine, and when B is pyrimidine. it is attached at the N 1 -position; 
wherein A represents a moiety consisting of at least three carbon atoms which is capable of forming a 
detectable complex with a polypeptide when the compound is incorporated into a double-stranded 
ribonucleic acid, deoxyribonucleic acid duplex, or DNA-RNA hybrid; 

wherein the dotted line represents a linkage group ioining B and A, provided that if B is purine the linkage is 
attached to the 8-position of the purine, if B Is 7-deazapurine, the linkage is attached to the 7-position of the 
deazapurine, and if B is pyrimidine, the linkage Is attached to the 5-position of the pyrimidine; and 
wherein each of x, y and z represents 



P 0 0 0 0 0 

r. -~ ~ U (( II II tl || 

H0-, H0-P-O-, HO-P-0-P-0-, or HO-P-O-P-O-P-O- . 

OB . OH OH OH OH OR 



These compounds are widely useful as probes in biomedical research and recombinant DNA technol- 

09V Although in principal all compounds encompassed within this structural formula may be prepared and 
used in accordance with the practices of this invention, certain of the compounds are more readily prepared 
or used or both, and therefore are presently preferred. 

Thus, although purines, pyrimidines and 7-deazapurines are in principal useful, pyrimidines and 7- 



EP 0 329 198 A2 



deazapurines are preferred since purine substitution at the 8-position tends to render the nucleotides 
ineffective as polymerase substrates. Thus, although modified purines are useful in certain respects, they 
are not as generally useful as pyrimidines and 7-dea2apurines. Moreover, pyrimidines and 7-deazapurines 
useful in this invention must not be naturally substituted at the 5- or 7- positions, respectively. As a result, 
certain bases such as thymine, 5-methylcytosine, and 5-hydroxymethylcytosine are not useful. Presently 
preferred bases are cytosine, uracil, dea2aadenine and deazaguanine. 

A may be any moiety which has at least three carbon atoms and is capable of forming a detectable 
complex with a polypeptide when the modified nucleotide is incorporated into a double-stranded duplex 
containing either deoxyribonucleic or ribonucleic acid. 

A therefore may any ligand which possesses these properties, including haptens which are only 
immunogenic when attached to a suitable carrier, but are capable of interacting with appropriate antibodies 
to produce complexes. 

Examples of moieties which are useful include: 




50 Of these the preferred A moieties are biotin and iminobiotin. 

Moreover, since aromatic moieties tend to intercalate into a base-paired helical structure, it is preferred 
that the moiety A be non-aromatic. Also, since smaller moieties may not permit sufficient molecular 
interaction with polypeptides, it is preferred that A be at least Cs so that sufficient interaction can occur to 
permit formation of stable complexes. Biotin and iminobiotin satisfy both of these criteria. 

55 The linkage or group joining moiety A to base B may include any of the well known bonds including 
carbon-carbon single bonds, carbon-carbon double bonds, carbon-nitrogen singl bonds, or carbon-oxygen 
single bonds. However, it is generally preferred that the chemical linkage include an olefinic bond at the «- 
p sition r lative to B. The presence of such an a-olefinic bond serves to hold the moiety A away from the 
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base when the base is paired with another in the well known double-helix configuration. This permits 
interaction with polypeptide to occur more readily, thereby facilitating compi x formation. Moreover, single 
bonds with greater rotational freedom may not always hold the moiety sufficiently apart from the helix to 
permit recognition by and complex formation with polypeptide. 

It is even more preferred that the chemical linkage group be derived from a primary amine, and have 
the structure -CH 2 -NH-, since such linkages are easily formed utilizing any of the well known amine 
modification reactions. Examples of preferred linkages derived from allylamine and allyl-(3-amino-2- 
hydroxy-1 -propyl) ether groups have the formulae 
-CH = CH-CH 2 -NH- and 
-CH = CH-CH2-OCH2- CH -CH2-NH-, 

OH 

respectively. . , , , * .. . 

Although these linkages are preferred, others can be used, including particularly olefin linkage arms 
with other modifiable functionalities such as thiol, carboxylic acid, and epoxide functionalities. 

The linkage groups are attached at specific positions, namely, the 5-position of a pyrimidine, the 8- 
position of a purine, or the 7-position of a deazapurine. As indicated previously, substitution at the 8-position 
of a purine does not produce a modified nucleotide which is useful in all the methods discussed herein. It 
may be that the 7-position of a purine, which is occupied by a nitrogen atom, could be the point of linkage 
attachment However, the chemical substitution methods employed to date and discussed herein are not 

suitable for this purpose. , , 

The letters x f y, and z represent groups attached to the 5 , 3 . and 2 positions of the sugar moiety. 

They may be any of 

0 op goo 



u w u u ii it 

HO-, H0-P-O-, HO-P-O-P-O-, or HO-P-O-P— P-0-. 

\ I I III 

OH OB OB OB OB OB 



Although conceivable, it is unlikely that ail of x, y ( and z will simultaneously be the same. More likely at 
least one of x. y. and z will be a phosphate-containing group, either mono-, di-, or tri-phosphate and at least 
one will be HO- or Vk As will be readily appreciated, the most likely identity of z will be HO- or H- 
indicating ribonucleotide or deoxyribonucleotide, respectively. Examples of such nucleotides include 5 - 
ribonucleoside monophosphates, 5-ribonucleoside diphosphates, 5 -ribonucleoside triphosphates, 5-deox- 
yribonucieoside monophosphates, 5 -deoxyribonucleoside diphosphates, 5 -deoxyribonucleoside 
triphosphates, s'p- ribonucleoside-3p. and 5 p-deoxyribonucleoside-3 p. More specific examples include 
modified nucleotides of this type in which A is biotin or iminobiotin, the chemical linkage is 
-CH = CH-CH 2 -NH- or 
-CH = CH-CH 2 -0-CH 2 - CH -CH 2 -NH-, 
OH 

and B is uracil or cytosine. 

The general synthetic approach adopted for introducing the linker arm and probe moiety onto the base 
is discussed hereinabove. (See especially, J.L Ruth and D.E. Bergstrom. J. Org. Chem.. 43, 2870, 1978; 
D E Bergstrom M.K. Ogawa, J. Amer. Chem. Soc. 100, 8106, 1978; and C.F. Bigge. P. Kalaritis, J.R. Deck 
and MP. Mertes, J. Amer. Chem. Soc. 102, 2033, 1980.) However, the olefin substituents employed herein 
have not been used previously. To facilitate attachment of probe moiety A, it has been found particularly 
desirable to employ olefins with primary amine functional groups, such as allylamine [AA] or al!yl-(3-amino- 
2-hydroxy-1 -propyl) ether [NAGE], which permit probe attachment by standard amine modification reac- 
tions, such as, 
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-Cfl2 NH 2 + R-C-OR -CH2NHCR 
Znidat 



10 



.-CH2NH2 + R-J 
O 



^0.— *> •CH 2 NHCR 



;5 



Anhydride 



20 

•CH2NH2 



• [ NOCB — * -CB.flHCK 
NHS-ester ( N-hydroxysucciniinide) 



25 



s 
a 

-CH2NH2 + R-K=»C»S-> -CH2NHCNKR 
30 Isothiocyanate 

0 <^ 
35 -GH 2 NH 2 + ZA-R -CH 2 NaCH 2 CS ^ 

. Epoxide 

Because of ease of preparation it has been found preferable to use NHS-esters for probe addition. However, 
olefin linker arms with other modifiable functional groups, such as thiols, carboxylic acids, epoxides, and the 
like, can also be employed. Furthermore, both linker arm and probe can be added m a single-step if 
deemed desirable. 
45 Specifically, modified nucleotides having the structure: 



50 



55 wherein B represents a purine. 7-deazapurine. or pyrimidine moiety covalently bonded to the C 1 -position , of 
the sugar moiety, provided that when B Is purine or 7-deazapurine, it is attached at the N'-pos.t.on of the 
purine or deazapurine. and when B is pyrimidine, ft is attached at the N'-position; 

wherein A represents a moiety consisting of at least three carbon atoms which is capable of forming a 
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detectable complex with a polypeptid when the compound is incorporated into a double-stranded 
ribonucleic acid, deoxyribonucleic acid duplex, DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage joining B and A, provided that if B is purine, the 
linkage is attached to the 8-position of the purine, if 7-deazapurine, the linkage is attached to the 7-position 
of the deazapurine, and if B is pyrimidine, the linkage is attached to the 5-position of the pyrimidine; and 
wherein each of x, y, and z represents 



70 



o j> o <f o o 

HO-/ HO-P-0-, KO-P-0-P-0-, or HO-P-0-P-0-P-0-, 
OH OH 6H 6H OH OH 



can be prepared by: 
16 (a) reacting a compound having the structure: 



20 



«* y z 



with a mercuric salt in a suitable solvent under suitable conditions so as to form a mercurated compound 
having the structure: 



30 




y 



(b) reacting said mercurated compound with a chemical moiety reactive with the -Hg + portion of said 
mercurated compound and represented by the formula •**N, said reaction being carried out in an aqueous 
solvent and in the presence of KaPdCU under suitable conditions so as to form a compound having the 
structure: 

B"*N 

* 



y * 




wher in N is a reactive terminal functional group or is A; and 

(c) recovering said compound as said modified nucleotide when N is A, or when N is a reactive 
terminal group, reacting said compound with a compound having the structure M-A, wherein M represents a 
functional group reactive with N in an aqueous solvent under suitable conditions, so as to form said 
modified nucleotide, which is then recovered. 

The following schema is illustrative: 
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40 



C- CH 2 
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55 



Although the reactions can be carried out at hydrogen ion concentrations as low as pH 1, or as high as pH 
14 it is preferred to operate in the range from about 4 to 8. This is especially true when dealing with 
unstable compounds such as nucleoside polyphosphates, polynucleotides, and nucleotide coenzymes 
which are hydrolyzed at pH's outside this range. Similarly, it is preferred to operate at a temperature in the 
ranqe from about 20* C to 30* C to avoid possible decomposition of labile organic substrates. However, 
the reactions can be carried out at temperatures from about 5 C to 100 C. As is usual iwtth chem.caJ 
reactions, higher temperatures promote the reaction rate and lower temperatures retard it Thus, in the 
temperature range from 5* C to 100* C. the optimum reaction time may vary from about 10 minutes to 98 
hours. In the preferred temperature range, reaction times normally vary from about 3 to 24 hours. 

The preferred procedure for maintaining the pH in the desired range is through the use of buffers. A 
variety of buffers can be employed. These include, for example, sodium or potassium acetate, sodiurn or 
potassium citrate, potassium citrate-phosphate, tris-acetate and borate-sodium hydroxide buffers. The 
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concentration of buffer, when employed, can vary over a wide range, up to about 2.0 molar. 

While a particular advantage of the mercuration and palladium catalyzed addition reactions is that they 
can be carried out in water, small amounts of an organic solvent can be usefully included as a solubility aid. 
The organic solvents usually chosen are those which are miscible with water. These may be selected from 
ethers alcohols, esters, ketones, amides, and the like such as methanol, ethanol. propanol. glycenn. 
dioxane, acetone, pyridine and dimethylformamide. However, since it has been observed that the presence 
of alcohols, such as methanol, often results in alkoxy-additlon across the olefin double bond, any organic 
solvent used as a solubility aid should be chosen carefully. Introduction of alkoxy substituents to the <*- or 
0- exocyclic carbon atoms often results in the production of compounds which are utilized much less 
efficiently as enzyme substrates. . 

Although various mercuric salts may be utilized, the presently preferred salt is mercunc acetate. Also, 
as indicated previously, the compounds may be prepared by first adding a linker arm and then the moiety 
A or by adding a linker arm to which A is already attached. Thus, the chemical moiety represented by the 
formula " 'N may be any one of the numerous entities which ultimately result in production of the desired 

compounds. 

Examples include -CH = CH-CH 2 -NH 2 , 
-CH = CH-CH 2 -0-CH 2 - CH -CHz-NHz. 



i 

OH 



-CH=CH-CH 2 -NH-biotin, and 
-CH = CH 2 -CH 2 -0-CH 2 - Ch -CHz-NH-iminobiotin. 



I 

OH 



The amounts of the reactants employed in these reactions may vary widely. However, in genera) the 
amounts of unmercurated compound, mercurated compound, and palladium-containing compound will be 
substantially stoichiometric whereas the mercuric salt and compound —N will be present in molar excess, 
eg 5-20 moles of -"N or of mercuric salt per mole of mercurated compound or unmercurated 
compound, respectively. In practice, amounts will vary depending upon variations in reaction conditions and 
the precise identity of the reactants. * 

Having the biotin probe directly attached to nucleotide derivatives that are capable of functioning as 
enzyme substrates offers considerable versatility, both in the experimental protocols that can be performed 
and in the detection methods (microscopic and non-microscopic) that can be utilized for analysis. For 
example biotin nucleotides can be introduced into polynucleotides which are in the process of being 
synthesized by cells or crude cell extracts, thus making it possible to detect and/or isolate nascent 
(growing) polynucleotide chains. Such a procedure is impossible to do by any direct chemical modification 
method. Furthermore, enzymes can be used as reagents for introducing probes such as biotin into highly 
selective or site-specific locations in polynucleotides; the chemical synthesis of similar probe-modified 
products would be extremely difficult to achieve at best. 

The synthesis of nucleotides containing biotin or iminobiotin was achieved as detailed in the examples 
set forth hereinafter. Pyrimidine nucleoside triphosphates containing either of these probes attached to the 
C-5 carbon atom were good to excellent substrates for a wide variety of purified nucleic acid polymerases 
of both prokaryotic and eukaryotic origin. These include DNA polymerase I or E. coli. bacteriophage T4 
DNA polymerase, DNA polymerases a and from murine (A-9) and human (HeLa) cells, and the DNA 
polymerase of Herpes simplex virus. Confirming data were obtained with E. Coli DNA polymerase I using 
either the nick-translation condition of Rigby. et al. (P.W.J. Rigby, M. Dieckmann. C. Rhodes and P. Berg. J. 
Mol Biol 113, 237. 1977) or the gap-filling reaction described by Bourguignon et al. {Q.J. Bourguignon. P. 
J TattersaTand D.C. Ward. J. Virol. 20. 290. 1976). Bio-dUTP has also been found to function as a 
polymerase substrate both in CHO cells, permeabilized by treatment with lysolecithin according to the 
method of Miller, et al. (M.R. Miller, J.C. Castellot. Jr. and A.B. Pardee. Exp. Cell Res. 120. 421. 1979) and 
in a nuclear replication system prepared from Herpes simplex Infected BHK cells. Although biotmyl 
ribonucleoside triphosphates were found to function as substrates for the RNA polymerases of E. Coh and 
bacteriophage T7. they are not utilized as efficiently as their deoxyribonucleotide triphosphate counterparts. 
Indeed, they are incorporated poorly, if at all. by the eukaryotic RNA polymerases examln ed (HeLa ceU 
RNA polymerase III, calf thymus RNA polymerase II and mouse cell RNA polymerase II). While this limited 
range of substrate function does restrict the utility in some in vivo or in vitro transcription studies, biotJn- 
labeled RNA probes can be prepared enzymatically from DNA templates using E. coli or T7 RNA 
polymerases or by 3 end-labeling methods using RNA ligase with compounds such as biotinyl-pCp. The 
AA- and NAGE-derivatives of UTP are, however, substrates for the eukaryotic RNA polymerases mentioned 
above. With the availability of antibodies to these analogs, the isolation of nascent transcripts by im- 
munological or affinity procedures should be feasible. 



12 



EP 0 329 198 A2 



70 



16 



The enzymatic polymerization of nucleotides containing biotin or iminobiotin substituents was not 
monitored directly, since neither of these probes were radiolabeled. However, two lines of experimental 
evidence clearly show that the biotinyl-nucleotides were incorporated. The first is that polynucleotides 
synthesized in the presence of biotin-nucleotides are selectively retained when chromatography over 
avidin or streptavidin affinity columns. (Tables I and II). For example, whereas normal DNA, nick translated 
with 32p-dAMP. is quantitatively eluted upon the addition of 0.5 M NaCI. the vast majority of b.otinyl-DNA or 
iminobiotinyl-ONA remains bound to the resin even after extensive washing with high salt. urea, quanidme- 
HCI formamide or 50 mM NaOH. The small fraction of the radiolabeled eluted by these washing conations 
is not retained when applied to the resin a second time, suggesting that radioactivity is associated with DNA 
fragments which are free of biotin substitution. The second line of evidence is that only biotn-labeled 
polynucleotides are immunoprecipitated when treated with purified anti-biotin IgG followed by formalin-fixed 
btephylococcus aureus. (Table III). It is clear from the data in these tables that extremely small amounts of 
biotin can be detected by this method. These results also show that the biotin molecule can be recognized 
by avidin. streptavidin or specific antibodies while the DNA is still in its native, double-stranded form a 
condition that is absolutely essential if the antibody-binding or avidin-affinlty approaches are to be useful in 
probe detection employing hybridization techniques. 



TABLE I 



SELECTIVE RETENTION OF BIOTINIZED DNA ON 
AVIDIN-SEPHAROSE 


Eluent 




% DNA Retained on 
Resin 






Bio-DNA 
(1%) 


T-DNA 


Load 

(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 


- 3 x 10 s cpm 

10mMTris7.5 + 0.2 M NaCI 

0.5 M NaCI 

1.0 M NaCI 

8 M Urea 

6 M guanidine-HCI 

99 % formamide 

2 mM Biotin 

50 mM NaOH 


100 

100 
99.7 

100 
95.2 
94.7 
97.6 
89.5 


100% 
0.1 
<0.1 
<0.01 
<0.01 
<0.01 
<0.01 
<0.01 



TABLE II 

to 



Affinity Chromatography of Iminobiotin-dUTP and Iminobiotmized - DNA on 
Streptavidin-Sepharose 


Eluent 




% Retained on SA-Sepharose 






T-DNA 


3H-IB-dUTP 


IB-DNA 


Load 

(D 
(2) 
(3) 

w 

(5) 
(6) 


-10mM Tris-HCI, 8.3 
50 mM NaCI 
0.1 M NaCI 
1.0 M NaCI 
8 M Urea 

6 M guanidine-HCI 

50 mM NH+-acetate, pH 4.0 

50 mM NHi-acetate, pH 4.0 2 mM biotin 


8.7 
<0.1 
<0.01 
<0.01 
<0.01 
<0.01 
<0.01 


100 

100 

100 
97.5 
97.0 
<0.01 
<0.01 


99.7 
99.7 
99.4 
98.5 
97.0 
96.5 
<0.01 
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TABLE III 



SELECTIVE IMMUNOPRECIPITATION OF BIO-DNA WITH 


ANTI-BIOTIN IgQ and STAPH AUREUS 


DNA* 


Antibody 


CPM in 


CPM in 






Immuno ppt. 


Supernatant 


T-DNA 




70 


4867 


T-DNA 


Anti-Bio IgG 


87 


5197 


T-DNA 


Non-immune IgG 


55 


5107 


Bio-DNA 




53 


3886 


Bio-DNA 


Anti-Bio IgG 


3347 


736 


Bio-DNA 


Non-immune IgG 


60 


3900 



*N.T. pBR-322 DNA. 32p -labeled; 1% Biotin substitution. Specific activity, 2 x 
10 7 cpm/ug Biotin detection 0.001-0.01 pmoles. 



Thus, it is possible to prepare novel compounds having the structure: 



20 



? 



I 

OH 



25 



30 




36 



--O-CH 



OB 



40 



45 




-OH 



so 



55 



wherein each of B, b', and b" represents a purine, deazapurine, or^yrimidine moiety covalentty bonded to 
the C -position of the sugar moiety, provided that whenever B, B , or B is purine or 7-deazapunne, it Is 
attached at the N 9 -positJon of the purine of deazapurine, and whenever B, B , or B is pyrimidine, it is 
attached at the N 1 -position; 

wherein A represents a moiety of consisting of at least three carbon atoms which is capable of forming a 
detectable complex with a polypeptide when the compound is incorporated into a double-stranded duplex 
formed with a complementary ribonucleic or deoxyribonucleic acid molecule; 

wherein the dotted line represents a linkage group joining B and A, provided that if B is purine, the linkage 
is attached to the 8-position of the purine, if B is7-d azapurine. the linkage is attached to the 7-posrtion of 
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the deazapurine, and if B is pyrimidine, the linkage is attached to the 5-position of the pyrimidin ; 
wherein z represents H- or HO; and 

wherein m and n represent integers from 0 up to about 100,000. 

Of course, it should be readily understood that in general m and n will not simultaneously be 0 since, in 

5 that event, the compound becomes merely a modified nucleotide as described pr viously. In general B and 
B" will vary within the same oligo- or polynucleotide, being alternatively uracil, cytosine, thymine, guanine, 
adenine or the like. Also, in general, the variation will correspond to the ordered sequence of nucleotides 
which codes for the synthesis of peptides according to the well known Genetic Code. However, it is 
intended that the structure shown also embrace polynucleotides such as poly C, poly U, poly r(A-U). and 

io poly d(A-U) as well as calf thymus DNA, ribosomal RNA of E. coli or yeast, bacteriophage RNA and DNA 
<R17. fd). animal viruses (SV40 DNA), chromosomal DNA. and the like, provided only that the poly- 
nucleotides be modified in accordance with this invention. 

It is also to be understood that the structure embraces more than one modified nucleotide present in 
the oligomer or polymer, for example, from two to thirty modified nucleotides. The critical factor in this 

75 regard Is that the number of modifications not be so great that the polynucleotide is rendered ineffective for 

the intended use. . . 

Finally, it should be understood that modified oligo- and polynucleotides can be joined to form larger 
entities having the same structure so long as terminal groups are rendered compatible or reactive. 

These compounds can be made by enzymatic polymerization of appropriate nucleotides, especially 

so nucleotide triphosphates in the presence of a nucleic acid template which directs synthesis under suitable 
conditions. Such conditions can vary widely depending upon the enzyme employed, amounts of nucleotides 
present and other variables. Illustrative enzymes include DNA polymerase I of E. coli. bacteriophage T4 
DNA polymerase, DNA polymerases a and fi from murine and human (HeLa) cells. DNA polymerase from 
Herpes simplex virus, RNA polymerase of E. coli, RNA polymerase of bacteriophage 17, eukaryotic RNA 

as polymerase including HeLa cell RNA polymerase III, calf thymus RNA polymerase II, and mouse cell RNA 

polymerase II. , ^ . L . 

Also the compounds can be prepared by terminal addition to oligo- or polynucleotides to produce 

compounds in which m or n is 0 depending upon whether the addition is at the 5 or 3 position. Moreover. 

the compounds such as pCp or pUp in which the base is biotinized can be added to existing molecules 
30 employing the enzyme RNA ligase. 

< Modified oligo- and polynucleotides can also be prepared by chemical modification of existing oligo- or 

polynucleotides using the approach described previously for modification of individual nucleotides. 

The various modified nucleotides, oligonucleotides, and polynucleotides of this invention may be 

detected by contacting the compounds with polypeptides which are capable of forming complexes therewith 
35 under suitable conditions so as to form the complexes, provided that the polypeptides Include one or more 

moieties which can be detected when the complex or complexes is or are formed, generally by means of 

conventional detection techniques. 

One polypeptide detector for the biotinyl-type probe is avidin. The avidin-biotin interaction exhibits one 

of the tightest non-covalent binding constants (K^IO" 15 ) seen in nature. If avidin is coupled to potently 
40 demonstrable indicator molecules. e.g.. fluorescent dyes (fluorescein, rhodamine). electron-dense reagents 

(ferritin, hemocyanin, colloidal gold), or enzymes capable of depositing insoluble reaction products 

(peroxidase, alkaline phosphatase) the presence, location and/or quantity of the biotin probe can be 

established. 

Avidin has. unfortunately, one property that makes it less desirable as a biotin-indicator protein when 
45 used in conjunction with nucleic acids or chromatin material. It has been reported (M.H. Heggeness. Stain 
Technol.. 52, 165, 1977; M.H. Heggeness and J.F. Ash. J. Cell. Biol., 73. 783. 1977; E.A. Bayer and M. 
Wllchek. Methods of Biochemical Analysis 26. 1. 1980) that avidin binds tightly to condensed chromatin or 
to subcellular fractions that contain large aSiounts of nucleic acid in a manner which is independent of its 
biotJn-binding property. "Since avidin is a basic glycoprotein with a pi of 10.5, its histone-like character or its 

so carbohydrate moieties are most likely responsible for these observed non-specific interactions 

A preferred probe for biotin-containing nucleotides and derivatives is streptavidin, an avidin-like protein 
synthesized by the soil organism Streptomyces avidinii . Its preparation and purification is described in 
Hoffman, et al.. Proc. Natl. Acad. Sci.. 77, 4666 (1980). Streptavidin has a much lower pi (5.0), is non- 
glycosylated, and shows much lower nonspecific binding to DNA than avidin. and therefore offers potential 
55 advantages in applications involving nucleic acid detection methodology. 

A most preferred protein for biotin-like probe detection is monspecific rabbit IgG. antibiotin im- 
munoglobulin. This compound was prepared by immunizing rabbits with bovine serum album.n conjugated 
biotin as described previously (M. Berger. Methods in Enzymology. 62, 319 [1979]) and purified by affinity 
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chromatography. Although the association constant of immunoglobulin-haptens have values of Kassn (10 6 to 
10 10 ) which are considerably lower than for avidin-biotin complexes, they are substantially equivalent to 
those observed with the avidin-iminobiotin complex. Furthermore, the anti-biotin antibodies have proven 
extremely useful in detecting specific polynucleotide sequences on chromosomes by in situ hybridization 
since little, if any, non-specific binding of the antibody to chromatin material occurs. 

The modified polynucleotides of this invention are capable of denaturation and renaturation under 
conditions compatible with their use as hybridization probes. An analysis of the thermal denaturation profiles 
and hybridization properties of several biotin-substituted DNA and RNA polymers clearly indicates this. For 
example, pBR 322 DNA or X DNA, nick translated to introduce approximately 10-100 biotin residues per 
kilobase, have Tm values essentially identical to that of the control, biotin-free DNAs. Furthermore, 32p - 
labeled, biotin-substituted, pBR 322 DNA, exhibited the same degree of specificity and autoradiographic 
signal intensity as control, thymidine-containing DNA, when used as a hybridization probe for detecting 
bacterial colonies containing the plasmid. 

in DNA duplexes, such as MVM RF DNA, in which every thymidine residue in one strand (1250 in toto 
per 5 Kb) is replaced by a biotinyl-nucleotide, the Tm is only 5* C less than that of the unsubstituted 
control. Although the Tm of poly d(A-bioU) in which ease base pair contains a bio-dUMP residue is 15 C 
lower than the poly d(A-T) control, the degree of cooperative and the extent of hyp erchromicity observed 
both during denaturation and renaturation were the same for the two polymers. A parallel analysis of RNA 
duplexes and DNA/RNA hybrids indicates that their Tm's also decrease as the biotin-content of the polymer 
increases. However, it is clear that a substantial number of biotin-molecules can be introduced without 
significantly altering the hybridization characteristics of the polymer. 

These results strongly suggested that biotin-substituted polynucleotides could be used as probes for 
detecting and/or localizing specific polynucleotide sequences in chromosomes, fixed cells, or tissue 
sections. 

The general protocol for detecting the biotin-substituted probe is schematically illustrated as follows: 



30 
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GENERAL PROTOCOL FOR PROBE DETECTION! 
VIA IN 5"/n/, COLONY, OR NORTHERN /SOUTHERN 

hybridizaYion METHODS 



Antt probe sequence 

..J, ,:,i,i.:r.r :hi • ■ ,tttv 



I) Target- 
. Oelivery 



Hybridize with bioti'nized or 
hoptenized probe (with or with* 
v out cloning ve chicle sequences) 




6 6 6 6 6 



2) Signai 
.. Amplification 




-O* Biotin or 
Haptene 



I} Avidin -peroxidase 
2) IgG - peroxidase 
v 3} Primary cr-determinent Ig 6 




in l2) <»2p 



3) D elections I). insoluble peroxidase products' DAB 
2) Antibody sandwiching techniques 



This general scheme illustrates only procedures used for gene mapping (cytogenetics), and recom- 
binant DNA-technologies. However, it can be equally well applied to the detection of nucleic acid sequences 
of bacterial, viral, fungal or parasite origin in clinical samples and this forms the basis of a powerful new 
approach to clinical diagnostics which doss not rely on the use of radioisotopes. 

Immunological and histochemical methods for the detection of biotin have shown that the basic 
approach is useable for a rapid method of gene mapping in situ hybridization and non-radioactive 
procedures for detecting specific nucleic acid sequences by blotting hybridization methods. Use may be 
made of this technology in development of new clinical diagnostic procedures. 

Using this approach. It is possible to determine the presence of a specific deoxyribonucleic or 
ribonucleic acid molecule, particularly such a molecule derived from a living organism, e.g. bacteria, fungus, 
virus, yeast, or mammal. This in turn permits diagnosis of nucleic acid-containing etiological agents in a 
patient or other subject 

Moreover, it provides a method for screening bacteria to determine antibiotic resistance. Thus, for 
example, penicillin resistance in Streptococcus pyogenes or Neisseris meningitidis; tetracycline resistance 
in Staphyl ococcus aureus . Candida albicansTPseudomonas aeruginosa . Streptococcus pyogenes, or Neis- 
seria gonorrhoeae; Sidamino-glycoside resistance in Mycobacterium tuberculosis can be determined. 

[n these methods a polynucleotide Is prepared which is complementary to the nucleic acid sequence 

which characterizes the organism or its antibodic resistance and which additionally includes one or more 
modified nucleotides according to this invention. This polynucleotide is hybridized with nucleic acid 
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obtained from the organism under scrutiny. Failure to hybridize indicates absence of th organism or of the 
resistance characteristic. Hybridized nucleic acid duplexes are then identified by forming a complex 
between the duplex and a suitable polypeptide which carries a detectable moiety, and detecting the 
presence of the complex using an appropriate detection technique. Positive detection indicates that the 
complex, the duplex and therefore the nucleic acid sequence of interest are present. 

This approach can be extended to the diagnosis of genetic disorders, such as thalassemia and sickle 
cell anemia. The deoxyribonucleotide acid gene sequence whose presence or absence (in the case of 
thalassemia) is associated with the disorder can be detected following hybridization with a polynucleotide 
probe according to this invention based upon complex formation with a suitable detectable polypeptide. 

The mapping of genes or their transcripts to specific loci on chromosomes has been a tedious and 
time-consuming occupation, involving mainly techniques of cell-fusion and somatic cell genetics. Although 
in situ hybridization has been employed successfully for mapping single-copy gene sequences in species 
thatTTndergo chromosomes polytenization, such as Drosophila , detection of unique sequence genes in most 
hiqher eukaryotic chromosomes has been extremely difficult, if not impossible, using standard hybndization 
methods The necessity for polynucleotide probes of very high specific radioactivity to facilitate autoradiog- 
raphic localization of the hybridization site also results In rapid radiodecomposition of the probe and a 
concomitant increase in the background noise of silver grain deposition. The use of hybridization probes 
with low to moderate specific radioactivities requires exposure times or many days or weeks, even to detect 
multicopy sequences, such as ribosomal RNA genes or satellite DNA. Since recombinant DNA technology 
has made feasible the molecular cloning of virtually every single-copy sequence found in eukaryotic cells, it 
would be extremely beneficial to have a rapid and sensitive method for mapping the chromosomal ongin of 
such cloned genomic fragments. 

Modified nucleotides may be used in a method of gene map ping by In srtu hybridization wh ch 
circumvents the use of radioisotopes. This procedure takes advantage of a thymidine analogue containing 
biotin that can be incorporated enzymatJcally into DNA probes by nick translation. After hybridization in srtu 
the biotin molecules serve as antigens for affinity purified rabbit arrfi-biotin antibodies. Immunofluorescent 
antibody sandwiches made with fluorescein-labeled goat anti-rabbit IgG allow for rapid and specific 
cytogenetic localization of cloned gene sequences as green-yellow bands. This method offers four major 
advantages over conventional autoradiographic methods of in situ gene localization; less background noise 
an increase in resolving power between bands; a decrease in the time required to determine the site of 
probe hybridization; and chemically stable hybridization probes. This method has been applied successful I y 
to the localization of reiterated and unique DNA sequences in the polytene chromosome of Drosophila 
milanogaster and satellite DNA on mouse metaphase chromosomes. 

— Thus it h as been found that polytene chromosomes could be used as a test system for establishing the 
efficacy of probes using the modified nucleotides according to the instant invention as detected by indirect 
immunofluorescence for in situ gene mapping. The probes included a variety of clones Drpsophjla 
sequences obtained form OttoTchmidt and Dieter Soil, such as tRNA genes cloned in plasmid vectors with 
inserts of sizes ranging from about 5 to about 22 kilobases. Many of these clones have already been 
assigned to specific bands on the Drosphlla chromosome map by conventional in situ hybridization 
methods employing radioisotopes. j~™ 

DNA probes were nick translated in the presence of Bio-dUTP. Occasionally 3 H dATP and/or 3 H dCTP 
was included in the nick translation reaction mixture. This allowed both autoradiographic and im- 
munofluorescent localization of a sequence on a single chromosome spread. In sito hybridization was 
performed as described in M.L. Pardue. and J.G. Gall. Methods in Cell Biol.. 10, 1 (1975). After the final 2 x 
SSC wash to remove unhybridized probe, the slides were rinsed with PBS (phosphate buffered saline) and 
incubated at 37* C with 2.5 u. g/ml Rabbit anti-biotin in PBS and 10 mg/ml BSA for 2-16 hours. This was 
followed by incubation of the slides with FITC labeled Goat anti-Rabbit IgG (Miles Laboratories, diluted 
1:100 in PBS and 10 mg/ml BSA) for one-four hours. Evans Blue was often required as a red counterstain 
to see the chromosomes with fluorescent illumination. 

When plasmids pBR 17D and pPW 539 containing 5 Kb and 32 Kb inserts, respectively, were 
hybridized by this method, it was found that the pattern of hybridization is reproducible from spread to 
spread and is observed unambiguously on greater than 90% of the chromosome spreads on a given slide. 

The cloned transposable element pAC 104 is known to map at many sites along the Drosophila 
qenome. Comparison of the autoradlograph and the fluorescent picture obtained by in situ hybndizarjon of 
this probe illustrates a major advantage of this method, i.e.. that where diffuse regions of silver grains 
appear on an autoradiograph. doublets or a series of bands are discernible by immunofluorescent labeling. 

The other immediately obvious advantage of this method is the tremendous decrease in time required 
for gene assignments to be made by indirect immunofluorescence. An assignment of a DNA fragment to a 
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specific band can be made within six hours of hybridization. This is in comparison to days or weeks 
required for autoradiographic exposure methods. This factor in combination with increased resolution, 
makes the use of modified nucleotides detected by indirect immunofluorescence immediately preferable to 
more classical methods. 

5 It has been shown that this immunological method also works with mammalian chromosomes wherein 
satellite DNA has been mapped to the centromeric regions of mouse metaphase chromosomes. The result 
provides a basic foundation for the development of a simple gene mapping procedure for single copy 
(unique) sequences in chromosomes from human and other mammals. Such a procedure should greatly 
facilitate our understanding of the genetic organization of the chromosome and make clinical cytogenetic 

io diagnosis much more rapid and practical. 

While a single-step "antibody sandwich" method in which the chromosome spread is challenged, post- 
hybridization, with rabbit antj-biotin IgG may succeed, this protocol may not generate sufficient fluorescence 
for unambiguous gene assignments. However, a much stronger fluorometric signal can be achieved by 
using the "haptene-antibody sandwich technique", described by Lamm, et al., (1972); Wofsy. et al., (1974). 

75 In this procedure the primary antibody, in our case monospecific, rabbit anti-biotin IgG, is chemically 
modified with a haptenization reagent, such as 2, 4-dinitrofluorobenzene. preferably while the immuno- 
globulin is bound to an antigen affinity column (Wotin-Sepharose TM). As many as 15-20 haptene (DNP) 
groups can be coupled to the primary antibody without decreasing its antigen binding affinity or specificity 
(Wallace and Wofsy, 1979). If the primary antibody treatment of the test sample is followed by an 

20 incubation with a fluorescently labeled anti-hapten IgG antibody, rather than a fluorescently labeled anti-lgG. 
a 5-7 fold increase in fluorescence signal can be achieved. Since one also has available monospecific 
guinea pig anti-DNP IgG. we can haptenize this secondary antibody with biotin and thus generate two anti- 
hapten IgG populations, DNP-labeled anti-biotin IgG and biotin-labeled anti-DNP IgG. If these can be used 
alternately to achieve several rounds of hapten-antibody sandwiching and then followed with fluorescently 

25 labeled protein A from Staphylococcus aureus , which binds specifically to IgG molecules from many 
mammalian species, it could result in an enormous amplification of the primary antibody signal with its 
concomitant utility. 

The protein streptavidin from Streptomyces avidini is a potential alternative to anti-biotin igQ as a 
vehicle to specifically direct a coupled visualization system [e.g.. fluorescent probes (above) or histochem- 
30 ical reagents (below)] to the site of the hybridized biotin-containing polynucleotide. One of streptavidin's 
advantages over anti-biotin IgG is that its affinity for biotin is Kassn = 10" whereas association constants for 
haptene-IgG interactions are 10 7 to 10 10 . The fast reaction rate and extreme affinity mean that the time 
required to localize the biotinized probe will be minutes with streptavidin versus hours with immunologic 
reagents. 

as Initial evaluations of a streptavidin detection system are currently in progress. Polytene chromosomes 
hybridized with biotinized DNA probes will be incubated with streptavidin followed by a subsequent 
incubation with bovine serum albumin which has been doubly labeled with biotin and FITC (FITC, biotmyl- 
BSA). Since only one of the four streptavidin subunits is likely to be involved in binding at each biotinized 
DNA site, potentially one labeled BSA molecule can bind to each of the remaining three nonconjugated 

40 subunits of the streptavidin-biotinyl nucleotide complex. The fluorescence signal from this single strep- 
tavidin + FITC, biotinyl-BSA layer will be compared with a control using the basic "antibody sandwich 
mBthod" described earlier. 

If the "antibody sandwich" and streptavidin + FITC. biotinyl-BSA detection intensities are comparable, 
one can attempt to enhance the streptavidin + FITC, biotinyl-BSA system to single-copy sensitivity in a 

45 manner that parallels the multiple "haptene-antibody sandwich" approach. Since some of biotin groups on 
BSA will not be bound to the first layer of streptavidin, a second layer of streptavidin can be added until 
sufficient signal is obtained. For example, if in the second layer, only two streptavidin protomers bind to 
each first-layer BSA and each of these streptavidin protomers binds three RTC-biotinyl BSA molecules, 
then the second layer intensity will be twice as great as that from the first layer; for the third layer, with 

so analogous binding stolchiometries, the fluorescent intensity will be 12-fold that of the first layer, so the total 
intensity will rapidly increase with successively added layers. 

There are plans to use a larger carrier protein such as thyroglobulin rather than BSA in order to 
maximize amounts of attached fluorescent and biotin probes. It may also be necessary to use a longer 
linker arm between the biotin probe and the carrier protein. A longer linker arm should sterically optimize 

55 the theoretical delivery of a biotinized fluorescent carrier molecule to each nonconjugated streptavidin 
subunit and maximize the number of streptavidin protomers in th subsequent layer which will bind to the 
biotinized fluorescent carrier. As before, appropriate controls will be done to insure that substitution of the 
carrier protein with fluorescent probes and biotin does not cause solubility and/or nonspecific binding 
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problems. 

The streptavidin-carrier protein delivery system has two significant advantages over the imnnunofluores- 
cent approach in addition to its speed of delivery. First, only two protein components are needed to form 
the layers. Second, only the carrier protein needs to be modified and it is not necessary to maintain 
functional or even total struc tural integrity as long as the biotin groups are accessible to streptavidin. 

An alternative to the fluorescence method for visualizing hybridized probes is to direct enzymes such 
as peroxidase, alkaline phosphatase of p -galactosidase to the hybridization site where enzymatic conver- 
sion of soluble substrates to insoluble colored precipitates permits light microscope visualization. The 
important advantage of this technique is that the histochemical methods are 10 to 100-fold more sensitive 
than fluorescence detection. In addition, the colored precipitates do not bleach with extensive light exposure 
thus avoiding one of the general disadvantages of fluorescent light microscopy. These enzymes can be 
coupled to the final antibody instead of fluorescent probes in the "haptene-antibody sandwich" technique 
using Afunctional reagents such as glutaraldehyde or in the case of peroxidase via oxidation of the 
peroxidase carbohydrate moieties to aldehydes and coupling of these residues with € -amino groups of the 
desired protein. For the streptavidin-biotinized carrier protein method, an enzyme with biotinyl groups 
coupled to it could replace a fiuorescently-biotinized carrier system. Alternately, the enzyme could be 
coupled via biotin to the last layer of streptavidin with amplification of streptavidin sites being built up in 
preceding layers using biotinized BSA or thyreoglobulin. We will begin developing the necessary histochem- 
ical reagents and the appropriate substrate/insoluble product combinations for visualizing in sto hybridiza- 
tions without background problems in the near future. The histochemical approaches to signal amplification 
should therefore be ready for trial in the summer of 1981 . 

Detecting and/or imaging very low levels of fluorescent light is possible using currently available image 
intensifies or systems composed of lasers and photomultipliers. These methods permit the detection of 
light down to the level of individual photons. With suitable digital processing systems, images can be 
produced in which each point, i.e. each pixel, of the image is strictly proportional to the number of photons 
emitted by a point at the object. Using systems of this kind or flow systems in which the cells or parts of 
ceils flow past a laser beam, one can obtain detection sensitivity increases for fluorescent material of factors 
between 100 and 1000 beyond that which can be detected by the eye. This increase is sufficient to detect 
the fluorescence of single copy genes. 

In a preferred modification, analogs of dUTP and UTP that contain a biotin molecule covalently bound 
to the C-5 position of the pyrimidine ring through an allylamine linker arm have been synthesized. These 
biotinyl-nucleotides are efficient substrates for a variety of DNA and RNA polymerases in vitro. DNA 
containing low levels of biotin substitution (50 molecules or less/kilobase) has denaturation, reassociation 
and hybridization characteristics which are indistinguishable from that of unsubstituted control DNA. 

Thus, this invention also provides a method of chromosomal karyotyping. In this method, modified 
polynucleotides are prepared which correspond to known genes and include modified nucleotides. These 
polynucleotides are hybridized with chromosomal deoxyribonucleic acid and the resulting duplexes con- 
tacted with appropriate polypeptides under suitable conditions to permit complex formation. The polypep- 
tides include detectable moieties so that the location of the complexes can be determined and the location 

of specific genes thereby fixed. 

Another embodiment of this invention involves detection of poly A-containing sequences using poly U in 
which some of the uracil bases have been modified to contain a probe. Yet another embodiment involves 
cyclic modified nucleot ides in which two of x, y and z are reacted to form the cyclic moiety 

I I 

V. 

O OH 



Such cyclic modified nucleotides may then be used to identify hormone receptor sites on cell surfaces 
which in turn can be used as a method of detecting cancer or tumor cells. 

Finally tumor cells can be diagnosed by preparing polynucleotides which are modified according to this 
invention and are complementary to the messenger ribonucleic acid synthesized from a deoxyribonucleic 
acid gene sequence associated with the production of polypeptides, such as a-fetal protein or caranoem- 
bryonic antigen, the presence of which is diagnostic for specific tumor cells. Hybridization and detection of 
hybrid duplexes thus would provide a method for detecting the tumor cells. 
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The examples which follow are set forth to illustrate various aspects of the present invention but are not 
intended to limit in any way its scope as more particularly set forth in the claims. 



Example 1 and 2 



Synthesis of biotinyl - UTP and biotinyl - dUTP 



a) Preparation of Mercurated Nucleotides 

UTP (570 mg, 1.0 mmole) or dUTP 554 mg, 1.0 mmole) was dissolved in 100 ml of 0.1 M sodium 
acetate buffer pH 6.0, and mercuric acetate (1.59 gm, 5.0 mmoles) added. The solution was heated at 50 C 
for 4 hours, then cooled on ice. Lithium chloride (392 mg, 9.0 mmoles) was added and the solution 
extracted six times with an equal volume of ethyl acetate to remove exess HgCfe. The efficiency of the 
extraction process was monitored by estimating the mercuric ion concentration in the organic layer using 4, 
4-bis (dimethylamino)-thiobenzophenone (A.N. Christopher, Analyst, 94, 392 (1969). The extent of 
nucleotide mercuration. determined spectrophotometrically following iodination of an aliquot of the aqueous 
solution as described by Dale et al. (R.M.K. Dale, D.C. Ward, D.C. Livingston, and E. Martin, Nucleic Acid 
Res. 2. 915 [1975]). was routinely between 90 and 100%. The nucleotide products in the aqueous layer, 
whichoften became cloudy during the ethyl acetate extraction, were precipitated by the addition of three 
volumes of ice-cold ethanol and collected by centrifugation. The precipitate was washed twice with cold 
absolute ethanol, once with ethyl ether, and then air dried. These thus prepared mercurated nucleotides 
were used for the synthesis of the allylamine-nucleotides without further purification. 



b) Synthesis of allylamine - dUTP and allylamlne • UTP 

The mercurated nucleotides (of step a) were dissolved in 0.1 M sodium acetate buffer at pH 5.0, and 
adjusted to a concentration of 20mM (200 OD/ml at 267 nm). A fresh 2.0 M solution of allylamine acetate in 
aqueous acetic acid was prepared by slowly adding 1.5 ml of allylamine (13.3 mmoles) to 8.5 ml of ice-cold 
4 M acetic acid. Three ml (6.0 mmoles) of the neutralized allylamine stock was added to 25 ml (0.5 mmole) 
of nucleotide solution. One nucleotide equivalent of KaPdCU, (163 mg, 0.5 mmole). dissolved in 4 ml of 
water was then added to initiate the reaction. Upon addition of the palladium salt (AKa-Ventron) the soution 
gradually turned black with metal (Hg and Pd) deposits appearing on the walls of the reaction vessel. After 
standing at room temperature for 18-24 hours, the reaction mixture was passed through a 0.45 mm 
membrane filter (nai gene) to remove most of the remaining metal precipitate. The yellow filtrate was diluted 
five-fold and applied to a 100 ml column of DEAE-Sephadex TM A-25 (Pharmacia). After washing with one 
column volume of 0.1 M sodium acetate buffer at pH 5.0, the products were eluted using a one liter linear 
gradient (0.1-0.6 M) of either sodium acetate at pH - 8-9, or triethylammonium bicarbonate (TEAB) at pH 
7.5. The desired product was in the major UV-absorbing portion which eluted between 0.30 and 0.35 M salt 
Spectral analysis showed that this peak contained several products, final purification was achieved by 
reverse phase -HPLC chromatography on columns of Partisil - ODS2. using either 0.5M NH*H 2 PO«. buffer 
at pH 3.3 (analytical separations), or 0.5 M triethylammonium acetate at pH 4.3 (preparative separations) as 
eluents. The 5 -triphosphates of 5-(3 -aminopropen-l-yl) uridine (the allylamine adduct to uridine) were the 
last portions to be eluted from the HPLC column and they were clearly resolved from three, as yet 
uncharacterized, contaminants. These nucleotides were characterized by proton NMR elemental analysis 
tAA-dUTP (C, 2 H, 6 N 3 Ou Ps Na*.1 H 2 0): theory C, 22.91; H, 2.88; N, 6.68; P, 14.77. Found. C. 23.10; H. 
2.85; N. 6.49; P, 14.75. AA-UTP (Ci Z H, 6 Na Ois Pa Na+ .4H 2 0): Theory . C 20.61; H. 3.46; N, 6.01; P. 
13.3'. Found C, 20.67; H, 4.11; N. 5.39; P. 1354] spectrally and chromatographically. 

c) Biotination of AA-dUTP or AA-UTP 

Biotjnyl-N-hydroxysuccinimide ester (NHSB) was prepared from biotin (Sigma) as described previously 
(H. Heitzmann and F.M. Richards. Proc. Natl. AcadScL USA. 71. 3537 [1974]). AA-dUTP.rfeO (63 mg. 0.1 
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mmole) or AA-UTP«4H 2 0 (70 mg, 0.1 mmote) was dissolved in 20 ml of 0.1 M sodium borate buffer at pH 
8 5, and NHSB (34.1 mg, 0.1 mmole) dissolved in 2 ml of dimethyl formamide, was added. The reaction 
mixture was left at room temperatur for four hours and then loaded directly onto a 30 ml column of DEAE- 
Sephadex TM A-25, preequilibrated with 0.1 M TEAB at pH 7.5. The column was eluted with a 400 ml linear 
gradient (0.1-05 M) of TEAB. Fractions containing biotinyl-dUTP or biotinyl-UTP, which eluted between 0.55 
and 0.65 M TEAB, were desalted by rotary evaporation in the presence of methanol and redissolved in 
water Occasionally a slightly cloudy solution was obtained: this turbidity, due to a contaminant in some 
TEAB solutions, was removed by filtration through a 0.45 mm filter. For long term storage, the nucleotides 
were converted to the sodium salt by briefly stirring the solution in the presence of Dowex TM 50 (Na 
form). After filtration the nucleotide was precipitated by the addition of three volumes of cold ethanol, 
washed with ethyl ether, dried In vacuo over sodium hydroxide pellets, and stored in a dessicator at -20 C. 
For immediate use. the nucleotide solution was made 20 mM in in Tris-HCI at pH 7.5, and adjusted to a 
final nucleotide concentration of 5 mM. Stock solutions were stored frozen at -20 C. 

Elemental analysis of the bio-dUTP and bio-UTP products yielded the following results. Bio-dUTP (C 22 
H 30 N S 0,8 Pa Si Na*« 1 H 2 0). Theoretical; C, 29.80; H, 3.38; N, 7.89; P. 10.47; S. 3.61. Found; C, 30.14 
H322; N.7.63; P, 10.31; S, 3.70. Bio-UTP {C22 H30 N 5 Ots Ps St Na*«3 H 2 0): Theoretical: C, 29.15; H. 
3 19; N, 7!45; P, 9.89; S. 3.41. Found; C. 2a76; H, 3.35; N, 7.68; P, 9.81; S. 3.32. 

The spectral properties of bio-dUTP and bio-UTP at pH 7.5 [ X max, 289 nm ( e = 7,100): X max. 240 
nm ( « = 10.700); X min, 262 nm ( « = 4,300)] reflect the presence of an exocylic double-bond in 
conjugation with the pyrimidine ring. These nucleotides also give a strong positive reaction (an orange-red 
color) when treated with p-dimethylaminocinnamaldehyde in ethanolic sulfuric acid, a procedure used for 
biotin quantitation (D.B. McCormick and JA Roth, Anal. Biochem., 34, 326. 1970). However, they no longer 
react with ninhydrin. a characteristic reaction of the AA-dUTP and AA-UTP starting materials. 



Examples 3 and 4 



Synthesis of biotinyl-CTP and biotinyl-dCTP 

OTP and dCTP were a) mercurated, b) reacted with allylamine, and c) biotinized with NHS-biotin, 
essentially as described in Example 1. CTP (56.3 mg. 0.1 mmole) or dCTP (59.1 mg. 0.1 mmole) were 
dissolved in 20 ml of 0.1 M sodium acetate buffer at pH 5.0. and mercuric acetate (0.159 gm 0.5 mmoles) 
added. The solution was heated at 50* C for 4.5 hours then cooled on ice. Lithium chloride (39.2 mg. 0.9 
mmoles) was added and the solution extracted 6 times with ethyl acetate. The nucleotide products in the 
aqueous layer were precipitated by the addition of three volumes of cold ethanol and the precipitate 
collected by centrifugation. The precipitate was washed with absolute ethanol. ethyl ether, and then air 
dried. These products were used without further purification for the synthesis for AA-CTP and AA-dCTP. 
respectively. The mercurated nucleotides were dissolved in 0.1 M sodium acetate buffer at pH 5.0 and 
adjusted to a concentration of 10 mM (92 OD/ml at 275 nm). 0.6 ml (1.2 mmole) of a 2.0 M allylamine 
acetate stock (prepared as described in Example 1) was added to 10 ml of nucleotide solution (0.1 mmole) 
followed by the addition of fcPdCU (32.6 mg, 0.1 mmole), dissolved in 1.0 ml of H 2 0. After standing at 
room temperature for 24 hours, the solution was filtered through a 0.45 mM membrane to remove metal 
precipates. The filtrate was diluted five-fold and loaded onto a 50 ml column of DEAE-sephadex A-25, 
preequilibrated with 50 mM TEAB at pH 7.5. The nucleotide prod ucts were fractionated by application of a 
500 ml linear gradient (0.05-0.6 M) of TEAB at pH 7.5. The desired product was In the major UV absorbing 
portion which eluted between 0.28 and 0.38 M salt. The pooled samples were desaited by rotary 
evaporation, dissolved in 0.5 M triethylammonium acetate at pH 4.2, and final purification achieved by 
HPLC chromatography on columns of Partisil ODS-2. using 0.5 M triethylammonium acetate as the eluent. 
Appropriate fractions were pooled, lyophilized. and the products dissolved in H 2 p. The nucleotides were 
converted to the Na* salt by stirring briefly in the presence of Dowex TM 50 (Na form). After filtration, to 
remove the Dowex resin, the nucleotides were precipitated by the addition of 3 volumes of cold ethanol. 
The precipitate was washed with ether and then air dried. Analytical results: AA-dCTP (C, 2 H, 7 N* O13 Ps 
Na* • 2H 2 0); Theory. C. 22.29; H, 2.63; N. 8.67; P, 14.40. Found C, 22.16; H, 2.89; N, 8.77; P, 14.18. AA- 
CTP (C, 2 H, 7 N* O,* Na* • 2H 2 0); Theory C, 21.75; H, 2.57; N, 8.46; P, 14.01. Found. C, 22.03; H. 2.47; 
N, 8.69; P, 13.81; Spectral properties in 0.1 M Borate buffer at pH 8.0, X max 301 nm ( « =6,400), X min 
271 nm '( e" =3.950) X max 250 nm ( e =9,700). Both AA-dCTP and AA-CTP give a positive ninhydrin test. 
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AA-CTP (66 mg 0.01 mmole) or AA-dCTP (6.4 ing, 0.01 mmole) was dissolved in 5 ml of 0.1 M sodium 
borate buffer at pH 8.5, and NHS-biotin (3.4 mg, 0.01 mmole). dissolved in 0.2 ml of dimethylformamide, 
was added After sitting at room temperature for 4 hours the sample was chromatographed on a 10 ml 
column of DEAE-Sephadex A-25, using a 150 ml linear gradient (0.1-0.9 M) of TEAB at pH 7.5, as eluent. 
Fractions containing biotinyl-CTP or biotinyl-dCTP, which eluted between 0.50 and 0.60 M TEAB were 
pooled desalted by rotary evaporation, and after being adjusted to a final concentration of 5 mM in 0.02 M 
Tris-HCI buffer at pH 7.5, were frozen at -20* C. The products give a strong positive reaction for biotin with 
p-dimethylaminocinnamldehyde in ethanolic sulfuric acid but give and negative test for primary amines 
when sprayed with ninhydrin. Further structural characterization of these products is in progress. 



Examples 5 and 6 



Synthesis of Iminoblotlnyl-UTP and Iminobiotinyl-dUTP 

Iminobiotin hydrobromide was prepared from biotin as described previously (K. Hofmann, D.B. Melville 
and V du Vigneaud, J. Biol. Chem., 141. 207-211. 1941; K. Hofmann and A.E. Axelrod, Ibid.. 187. 29-33, 
1950) The N-hydroxysuccinimide (NHS) ester of iminobiotin was prepared using the, protocol previously 
described for the synthesis of NHS-Biotin (H. Heitanann and F.M. Richards. Proc. Nat Acad. Sc.. USA. 71_. 
5537 1974) AA-UTP (7.0 mg, 0.01 mmole) or AA-dUTP (6.3 mg. 0.01 mmole). prepared as detailed in 
example 1 (part b), was dissolved in 5 ml of 0.1 M sodium borate buffer at pH 8.5. and NHS-iminobiotin (3.5 
mg 001 mmole). dissolved in 0.5 ml of dimethytformarnide, was added. The reaction mixture was left at 
room temperature for 12 hours and then loaded directly onto a 10 ml column of DEAE-Sephadex A-25, 
preequilibrated with 0.05 M TEAB at pH 7.5. The column was eluted with a 150 ml linear gradient (0.05-0.6 
M) of TEAB. Fractions containing iminobiotin-UTP or iminobiotJn-dUTP, which eluted between 0.35 and 040 
M TEAB were desalted by rotary evaporation in the presence of methanol and dissolved in H 2 0. The 
products'contained a small amount of allylamine-nucleotide adduct as an impurity, as judged by a weak 
positive result in the ninhydrin test Final purification was achieved by affinity chromatography on avidin- 
sepharose. Fractions of the impure product made 0.1 M in sodium borate buffer at pH 8.5. were applied to 
a 5 ml column of avidin-sepharose and washed with 25 ml of the same buffer. The column was then 
washed with 50 mM ammonium acetate buffer at pH 4.0. which eluted the desired iminobiotin-nucleot.de 
product in a sharp peak. The nucleotide was precipitated by the addition of 3 volumes of cold ethanol, 
washed with ethylether. dried in vacuo over sodium hydroxide pellets and stored in a dessicator at -20 C. 
Products were characterized by"elemental analysis, as well as by spectral and chromotographic properties. 



Examples 7 and 8 



Synthesis of NAGE-UTP and NAGE-dUTP 

Allyl (3-amino-2-hydroxy-)propyl ether, abbreviated NAGE, was prepared from allyl glycidyl ether (Age) 
(obtained from Aldrich Chemical Co.). 10 ml of Age (84 mmole) was added slowly (in a fume hood) to 50 ml 
of 9 M ammonium hydroxide and the mixture allowed to stand at room temperature for six hours. Excess 
ammonia was removed by rotary evaporation under reduced pressure to yield a viscous yellow oil. Analysis 
of this product by proton NMR showed that it possessed the required structure. 5-mercun-dUTP (0.1 
mmole) or 5-mercuri-UTP (0.2 mmole) was dissolved in 2-4 ml of 0.2 M sodium acetate buffer at pH 5.0, 
and a 16 fold molar excess of NAGE adjusted to pH 5.0 with acetic acid prior to use. was added. The final 
reaction volumes (4.3 and 8.4 ml) had nucleotide concentrations of 43 and 42 mM, respectively. One 
equivalent of fcPdCI* (0.1 or 0.2 mmoles) was added to initiate the reaction. After standing at room 
temperature for 18 hours, the reaction mixtures were filtered through 0.45 u mM membranes the samples 
diluted five-fold, and chroma tographed on columns of DEAE-Sephadex A-25. using linear gradient (0.1-0.6 
M) of sodium acetate. Fractions containing the desired products, as judged by their UV spectra and 
characteristic HPLC elutlon profiles on Partisll ODS-2, were pooled, diluted, and further purified by 
rechromatography on DEAE-Sephadex using shallow gradients (0.1-0.5 M) of ammonium bicarbonate atpH 
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85 Under these conditions the majority of the NAGE-dUTP (or NAGE-UTP) could be cleanly separated 
from residual impurities. Proton NMR spectra were obtained at this stage of purification after the nucleotides 
were lyophilized and redissolved in D a O. For elemental analysis, the products were converted to their 
sodium salt form. Typical analytical results: NAGE-dUTP (C, s H 22 N 3 Ois P 3 Na* • 2 H 2 0). Theory, C, 
2499" H, 3.63; N, 5.83; P, 12.88. Found, C, 25.39; H, 3.71; N. 5.63; P, 12.88. 



Example 9 



Uses of Labeled DNA Sequences 



I. Karyotyping 

(a) Select from a human gene library some 100 to 200 clones. Label them as described above, and 
for each clone locate its place or places of hybridization visually or with a low-light-level video system. For 
those clones which correspond to a unique sequence gene this determines the location of the cloned DNA 
on a particular human chromosome. Obtain several clones for each chromosome. Each of those labeled 
clones can be used to identify particular chromosomes. They can also be used in combination to identify 
each of the 46 chromosomes as being one of the 22 autosomal pairs of the X or the Y. By allowing one set 
of labeled clones to hybridize to the chromosomes and then adding a fluorescent stain to the label, the set 
of clones and their locations can be visualized and will fluoresce with a particular color. A second set of 
labeled clones could then be used and reacted with a second fluorescent dye. The same process can be 
repeated a number of times. Thus one can. if desired, have several sets of fluorescent labels attached to 
the cellular DNA at different but specific locations on each of the chromosomes. These labels could be 
used for visual or computerized automatic karyotyping. 

(b) For automatic karyotyping, one could use one set of clones to identify the approximate location of 
each of the 46 chromosomes by finding sets of spots corresponding to the number of labeling sites on each 
chromosome. Thus, it is possible by computer analysis of the digitized images to determine if the 
chromosomes are suitably spread for further analysis. If they are suitably spread, then one can use 
computer analysis to identify each of the individual chromosomes by the location and distribution of the 
labelled spots on each one. 

By using the fact that the fluorescent spots can be placed at specific locations on each chromosome, 
one can carry out either manual or automatic karyotyping very much more effectively than without such 
labels. 



II. Diagnosis of Genetic Disorders 

By selecting the clones which bind specifically to a particular chromosome, such as number 23. it is 
possible to count the number of copies of the particular chromosome in a cell even if the chromosomes are 
not condensed at metaphase. Thus when fetal cells are obtained for prenatal diagnosis of trisomy 21. the 
diagnosis can be done even if the chromosomes are not condensed at metaphase. If necessary, two sets of 
labels can be used - one which would be specific for chromosome 23 and one for some other chromosome. 
By measuring in each cell the ratio of the two labels, which might be of different colors, it is possible to 
identify the cells which show an abnormal number of chromosomes number 23. This procedure could be 
used either on slides with a low-light-level video system or in a flow cytometer system using laser 
excitation. It can be used to determine any abnormal chromosome number. 



III. Microorganism Detection and Identification 

The labeling of specific sequences of DNA as described above permits identification and counting 
individual bacteria. In order to identify the individual bacteria to which a particular fragment of Dl 
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hybridizes the sensitivity must be such that a single labelled structure can be detected. This can be done 
using a low-lighHevel video system and computer summation of images, or by using some other devic for 
intensifying the light image. A flow system can also be used if the sensitivity can be made sufficiently 
grand. If one immobilized the bacteria on a slide their location could be found and the number of 
fluorescent spots counted. This would provide a count of all of those bacteria which contain DNA which can 
hybridize with the specific clone utilized. If the clone is selected as being specific for a particular strain of 
bacteria, then one can count the number of organisms of that strain. In addition, any antibiotic resistance for 
which a particular gene has been identified could be characterized in a similar way using, as a probe, the 
DNA sequence which is contained in the antibiotic resistance gene. In addition, a probe could be used 
which is specific for a resistance plasmid containing one or more antibiotic resistance genes. In addition to 
individual bacteria, groups of bacterial cells of a particular strain can be detected and their number 
estimated if they are located in a small spot so that the total fluorescence specific to the hybridized DNA in 
the spot can be measured. In this way the number of organisms containing a specific DNA sequence can 
be measured in a mixture of bacteria. 



Claims 

1. Ribose- and 2-deoxyribose compounds having the general formula: 




wherein B represents a 7-deazapurine or a pyrimidine moiety covalently bonded to the C 1 -position of the 
sugar moiety, provided that when B is a 7-deazapurine, the sugar moiety is attached at the N 9 -position of 
the deazapurine, and when B is a pyrimidine. the sugar moiety is attached at the N'-position of the 
pyrimidine; 

wherein A represents a ligand containing at least three carbon atoms; 

wherein the dotted line represents a linkage group joining B and A. the linkage group including carbon- 
carbon single bonds, carbon-carbon double bonds, carbon-nitrogen single bonds or carbon-oxygen single 
bonds, provided that if B is a 7-deazapurine. the linkage group is attached to the 7-position of the 
deazapurine, and if B is a pyrimidine. the linkage group is attached to the 5-position of the pyrimidine; 
wherein X represents a moiety selected from: 



HO-j»-, HO-jl-O-P-O-, or HO-I-0- 



OH OH OH 




and 

wherein Z is -OH or -H. . 

2. The compound in accordance with claim 1, wherein B is a uracil, cytosine, deazaadenine or 

deazaguanine moiety. ... 

3. The compound in accordance with claim 1 wherein the dotted line represents a linkage group joining 
B and A. the linkage group including an olefinic bond at the a position relative to B and/or a -CH 2 -HN- 
moiety. provided that if B is a 7-deazapurine, the linkage group is attached to the 7-position of the 
deazapurine, and if B is a pyrimidine. the linkage group is attached to the 5-position of the pyrimidine. 
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4. The compound in accordance with claim 1, wherein the linkag group is characterized by a moiety 

selected from 

-CH = CH-CH 2 -NH- or 

-CH=CH-CH 2 -0-CH 2 - CH -CH 2 -NH-. 

5 Oh 

5. The compound in accordance with claim 1 wherein A represents a ligand containing at least five 

carbon atoms. . 

6. The compound in accordance with claim 1 wherein A represents a hapten containing at least three 

to carbon atoms. , 

7. The compound in accordance with claim 1 wherein A represents a hapten containing at least five 

carbon atoms. . 

8. The compound in accordance with claim 1 wherein A is selected from the group consisting of biotin; 

iminobiotin; 

15 



— C 




I -C-CHj-NH— / \-NO» 



20 



NOx 



25 



30 



35 



1 

-C-CHi-CHiC-O— j -C-(CHtf« I '; and 

9 HO 
O O O 

0 

II 



X). 



9. A poly- or oligonucleotide sequence which comprises at least one of a moiety having the general 
formula: 



40 



45 




< I* • • A 



50 



55 



wherein B represents a purine, a 7-deazapurine or a pyrimidine moiety covalently bonded to the C - 
position of the sugar moiety, provided that when B is a purine or a 7-dea2apurine, the sugar moiety is 
attached at the Imposition of the purine or deazapurine, and when B is a pyrimidine, the sugar moiety is 
attached at the N'-position of the pyrimidine; 
wherein A represent a ligand containing at least three carbon atoms; 

wherein the dotted line represents a linkage group joining B and A, said linkage group including carbon- 
carbon single bonds, carbon-carbon doubl bonds, carbon-nitrogen single bonds or carbon-oxygen single 
bonds, provided that if B is a purine, the linkage group is attached to the 8-position of the purine, if B is a 7- 
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deazapurine, the linkage group is attached to the 7-position of the deazapurine, and if B is a pyrimidine, the 
linkage group is attached to the 5-position of the pyrimidine; and 
wherein Z represents a moiety selected from: 

HO-jl-CK HO-|-oXo-, BoXoJ-oJ-O-, -OH or -H. 
OE OH OH OB OH OH 



10. The sequence in accordance with claim 9, wherein B is a cytosine, uracil, deazaadenine, or 

deazaguanine. . 

11. The sequence in accordance with claim 9, wherein the linkage group Is charactenzed by a moiety 

selected from 

-CH = CH-CH 2 -NH- or 

-CH = CH-CH 2 -0-CH2- CH -CH 2 -NH- . 

OH 

12. The sequence in accordance with claim 9, wherein 2 is -OH or -H. 

13. The sequence in accordance with claim 9 wherein the dotted line represents a linkage group joining 
B and A, the linkage group including an olefinic bond at the a position relative to B and/or a -CH 2 -H N- 
moiety, provided that if B is a 7-deazapurine, the linkage group is attached to the 7-position of the 
deazapurine, and if B is a pyrimidine. the linkage group is attached to the 5-position of the pyrimidine. 

14. The sequence in accordance with claim 9 wherein A represents a ligand containing at least five 

carbon atoms. , t 

15. The sequence in accordance with claim 9 wherein A represents a hapten containing at least three 

carbon atoms. 

16. The sequence in accordance with claim 9 wherein A represents a hapten containing at least five 

carbon atoms. . , 

17. The sequence in accordance with claim 9 where in A is selected from the group consisting of biotln; 

iminobiotin; 




L2 



-C-CHa-CHjC-O-f -C-(CHik— 1 U ind 

0 



1 



18. A poly- or oligonucleotide sequence according to any of claims 9 through 17, further comprising a 
sequence complementary to nucleic acid sequences associated with an etiological agent 

19. The sequence of claim 18 wherein the etiological agent is selected from the group consisting of 
bacteria, viruses, and fungi. 
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20. The sequence of claim 18 wherein the etiological agent comprises a bacterium, and wherein the 
sequence further comprises a polynucleotide complementary to a nucleic acid sequence which confers 
resistance to an antibiotic. 

21. The sequence of claim 20 wherein the bacterium is Streptococcus pyrogenes or Neisseria 
meningitidis and the antibiotic is penicillin. 

22. The sequence of claim 20 wherein the bacterium is Staphylococcus aureus , Candida albicans , 

Pseudornonas aeruginosa , Streptococcus pyrogenes, or Neisseria gonorrhoeae and the antibiotic is a 

tetracycline. . . 

23. The sequence of claim 20 wherein the bacterium is Mycobacterium tuberculosis and the antibiotic is 

an aminoglycoside. 

24. A poly- or oligonucleotide sequence according to any of claims 9 through 17, the sequence further 
comprising a sequence complementary to nucleic acid sequences associated with a genetic disorder. 

25. The poly- or oligonucleotide sequence of claim 24 wherein the genetic disorder is thalassemia. 

26. A poly- or oligonucleotide sequence according to any of claims 9 through 17 for karyotyping, the 
sequence further comprising a sequence complementary to nucleic acid sequences associated with 
particular chromosomes. 

27. A poly- or oligonucleotide sequence according to any of claims 9 through 17 for detecting a poly A 
sequence, the sequence further comprising a sequence of uracil moieties. 

28. A double-stranded RNA or DNA duplex or RNA-DNA hybrid which comprises: 

in one strand a poly- or oligonucleotide sequence which comprises at least one of a moiety having the 
general formula: 



wherein B represents a purine, a 7-deazapurine or a pyrimidine moiety covalently bonded to the Op- 
position of the sugar moiety, provided that when 8 is a purine or a 7-dea2apurine t the sugar moiety is 
attached at the N 9 -position of the purine or deazapurine, and when B is a pyrimidine. the sugar moiety is 
attached at the N'-position of the pyrimidine; 
wherein A represent a ligand containing at least three carbon atoms; 

wherein the dotted line represents a linkage group joining B and A, said linkage group including carbon- 
carbon single bonds carbon-carbon double bonds, carbon-nitrogen single bonds, or carbon-oxygen single 
bonds, provided that if B is a purine, the linkage group is attached to the 8-position of the purine, if B is a 7- 
deazapurine, the linkage group is attached to the 7-position of the deazapurine, and if B is a pyrimidine, the 
linkage group is attached to the 5-position of the pyrimidine; and 
wherein Z represents 



29. The duplex or hybrid in accordance with Claim 28, wherein Z is -OH or -H. 

30. The duplex or hybrid in accordance with Claim 28, wherein the dotted line represents a linkage 
group joining B and A, the linkage group including carbon-carbon single bonds, carbon-carbon double 
bonds, carbon-nitrogen single bonds or carbon-oxygen single bonds, provided that if B is a 7-deazapurine, 
the linkage group is attached to the 7-position of the deazapurine, and if B is a pyrimidine, the linkage goup 
is attached to the 5-position of the pyrimidine. 

31. The duplex or hybrid in accordance with Claim 28, wherein B is a pyrimidine or a 7-deazapurine. 

32. The duplex or hybrid in accordance with Claim 28, wherein B is a uracil, cytosine, deazaadenine, or 
deazaguanine. 





'OB or -H. 
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33. The duplex or hybrid in accordance with Claim 28, wherein the linkage group is characterized by a 

moiety selected from 
-CH = CH-CH 2 -NH- or 
-CH = CH-CH 2 -0-CH 2 - CH-CH2-NH-. 

« OH 

34. The duplex or hybrid in accordance with claim 28 wherein A represents a ligand containing at least 

five carbon atoms. . , . 

35. The duplex or hybrid in accordance with claim 28 where A represents a hapten containing at least 

70 three carbon atoms. , . . 

36. The duplex or hybrid in accordance with claim 28 wherein A represents a hapten containing at least 

five carbon atoms. . . 

37. A method for detecting the presence or absence of a target in a sample comprising the steps, 
a) contacting the sample with a compound having the structure: 
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wherein each of B, b\ and B* represents a purine, deazapurine, or oyrimidine moiety covalently bonded to 
the C 1 -position of the sugar moiety, provided that whenever B, B , or B is purine or deazapurine, it is 
attached at the Imposition of the purine or deazapurine, and whenever B, B , or B is pynmidme, it is 
attached at the N 1 -position; wherein A represents at least one component of a signalling moiety and 
consists of at least three carbon atoms; wherein B and A are attached directly or through a linkage group 
the linkage group not interfering substantially with the characteristic ability of B to hybndize with the target 
or of A to produce a detectable signal, wherein if B is purine, A is attached to the 8-posltion thereof, if B is 
deazapurine A is attached to the 7-position thereof, and if B is deazapurine, A is attached to the 7-positon 
thereof, and if B is pyrimidine A is attached to the 5-position thereof; wherein I, m and n are integers, 
provided that I and n are not simultaneously 0, .provided further that m is never 0 and wherein z represents 
H- or HO-; and 

b) detecting any signal associated with compounds hybridized to the target 

38 the method of detecting the presence or absence of a target in a sample of claim 37 wherein the 
step of contacting further comprises contacting the sample with a polypeptide capable of forming a complex 
with A and including a moiety which can be detected when the complex is formed. 

39. The method of claim 37 wherein the moiety A of said compound is selected from the group 
consisting of biotin, iminobiotin, 



55 



29 



EP 0 329 198 A2 



JO 



75 



20 



! i 8 



40. The method of claim 37 wherein the polypeptide is selected from the group consisting of avidin, 
streptavidin, and anti-A immunoglobulin. 

41. The method of claim 37 wherein A* is a hapten and the polypeptide is an antibody thereto. 

42. The method of claim 37 wherein A is a Hgand. 

43 The method of claim 3 wherein the moiety included with the polypeptide which can be detected is 
fluorescent electron dense, or is an enzyme capable of reacting with a substrate to form a detectable 

reaction product. . . . ... 

25 44. The method of claim 47 wherein the target is a nucleic acid sequence derived from a living 

organism. 
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